Multiple fundamental frequency estimation based on sparse representations in a structured dictionary

نویسندگان

  • Michal Genussov
  • Israel Cohen
چکیده

a r t i c l e i n f o a b s t r a c t Automatic transcription of polyphonic music is an important task in audio signal processing, which involves identifying the fundamental frequencies (pitches) of several notes played at a time. Its difficulty stems from the fact that harmonics of different notes tend to overlap, especially in western music. This causes a problem in assigning the harmonics to their true fundamental frequencies, and in deducing spectra of several notes from their sum. We present here a multi-pitch estimation algorithm based on sparse representations in a structured dictionary, suitable for the spectra of music signals. In the vectors of this dictionary, most of the elements are forced to be zero except the elements that represent the fundamental frequencies and their harmonics. Thanks to the structured dictionary, the algorithm does not require a diverse or a large dataset for training and is computationally more efficient than alternative methods. The performance of the proposed structured dictionary transcription system is empirically examined, and its advantage is demonstrated compared to alternative dictionary learning methods.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Compressive Parameter Estimation with Earth Mover’s Distance via K-Median Clustering

In recent years, sparsity and compressive sensing have attracted significant attention in parameter estimation tasks, including frequency estimation, delay estimation, and localization. Parametric dictionaries collect observations for a sampling of the parameter space and can yield sparse representations for the signals of interest when the sampling is su ciently dense. While this dense samplin...

متن کامل

Wideband DOA Estimation via Sparse Bayesian Learning over a Khatri-Rao Dictionary

This paper deals with the wideband directionof-arrival (DOA) estimation by exploiting the multiple measurement vectors (MMV) based sparse Bayesian learning (SBL) framework. First, the array covariance matrices at different frequency bins are focused to the reference frequency by the conventional focusing technique and then transformed into the vector form. Then a matrix called the Khatri-Rao di...

متن کامل

Compressive parameter estimation via K-median clustering

In recent years, compressive sensing (CS) has attracted significant attention in parameter estimation tasks, including frequency estimation, time delay estimation, and localization. In order to use CS in parameter estimation, parametric dictionaries (PDs) collect observations for a sampling of the parameter space and yield sparse representations for signals of interest when the sampling is suff...

متن کامل

Compressive Parameter Estimation with Emd

COMPRESSIVE PARAMETER ESTIMATION WITH EMD FEBRUARY 2014 DIAN MO B.Sc., BEIHANG UNIVERSITY M.S.E.C.E., UNIVERSITY OF MASSACHUSETTS AMHERST Directed by: Professor Marco F. Duarte In recent years, sparsity and compressive sensing have attracted significant attention in parameter estimation tasks, including frequency estimation, delay estimation, and localization. Parametric dictionaries collect si...

متن کامل

Speech Enhancement using Adaptive Data-Based Dictionary Learning

In this paper, a speech enhancement method based on sparse representation of data frames has been presented. Speech enhancement is one of the most applicable areas in different signal processing fields. The objective of a speech enhancement system is improvement of either intelligibility or quality of the speech signals. This process is carried out using the speech signal processing techniques ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Digital Signal Processing

دوره 23  شماره 

صفحات  -

تاریخ انتشار 2013